Add unsorted decompressed chunk path even if we have sorted ones #6879

akuzm · 2024-05-03T15:04:17Z

The unsorted paths are better for hash aggregation, but currently if we're doing aggregation and we can push down the sort, we are only going to add sorted paths.

Fixes #6836
Fixes #7084

The unsorted paths are better for hash aggregation, but currently in this case we are only going to add sorted paths.

Add ANALYZE. To keep the desired MergeAppend plans, we also have to add a LIMIT everywhere so that the MergeAppend is chosen based on its lower startup cost. Otherwise the plain Sort over Append will be chosen because for small tables its cost is less.

Add ANALYZE after compression. The plan changes are expected, SeqScans are preferred over IndexScans and Sort over MergeAppend for small tables.

We would add extra Sort nodes when adjusting the children of space partitioning MergeAppend under ChunkAppend. This is not needed because MergeAppend plans add the required Sort themselves, and in general no adjustment seems to be required for the MergeAppend children specifically there.

the CI

…ppend_partially_compressed-* ordered_append-*

…y_compressed-* ordered_append-*

…append_partially_compressed-* ordered_append-*

src/planner/planner.h

This reverts commit e94bd26.

akuzm · 2024-12-19T14:16:33Z

tsl/test/expected/compression_ddl.out

+   Group Key: _hyper_31_114_chunk.device_id
+   ->  Sort
+         Sort Key: _hyper_31_114_chunk.device_id
+         ->  Gather
+               Workers Planned: 2
+               ->  Parallel Append


This should be GatherMerge above Sort, will be addressed here: #7547

akuzm · 2024-12-19T14:22:26Z

tsl/src/nodes/decompress_chunk/decompress_chunk.c

-			/*
-			 * Check if this path is parameterized on a compressed
-			 * column. Ideally those paths wouldn't be generated
-			 * in the first place but since we create compressed
-			 * EquivalenceMembers for all EquivalenceClasses these
-			 * Paths can happen and will fail at execution since
-			 * the left and right side of the expression are not
-			 * compatible. Therefore we skip any Path that is
-			 * parameterized on a compressed column here.
-			 */


I think I fixed this some time ago, we shouldn't be creating EquivalenceMembers on compressed columns of compressed chunk table anymore because they don't make sense anyway. Removed this check and the tests for older issues still pass.

svenklemm · 2024-12-21T10:06:43Z

Did this have any effect on planning time with many compressed chunks?

svenklemm · 2024-12-21T10:10:24Z

tsl/test/expected/merge_append_partially_compressed-15.out

@@ -200,8 +200,8 @@ generate_series(1,3) device;
                     Sort Method: top-N heapsort 
                     ->  Custom Scan (DecompressChunk) on _hyper_1_3_chunk (actual rows=30 loops=1)
                           Filter: (device = ANY ('{1,2,3}'::integer[]))
-                           ->  Index Scan using compress_hyper_2_6_chunk_device__ts_meta_min_1__ts_meta_max_idx on compress_hyper_2_6_chunk (actual rows=3 loops=1)
-                                 Index Cond: (device = ANY ('{1,2,3}'::integer[]))
+                           ->  Seq Scan on compress_hyper_2_6_chunk (actual rows=3 loops=1)


This seems like a regression? We have a constraint on the first index column so the index should be beneficial

For small tables I think it happens often that the Seq Scan is chosen instead of Index Scan. E.g. we often see this change after adding analyze.

akuzm · 2024-12-23T17:24:07Z

Did this have any effect on planning time with many compressed chunks?

There's a 5% regression on a couple of queries in the planning suite. I'll see if I can optimize this somehow. There were some changes in the "ordered_append_planning" suite, but it's actually an execution time change, I verified manually, and changed the queries to use explain.

https://grafana.ops.savannah-dev.timescale.com/d/fasYic_4z/compare-akuzm?orgId=1&var-branch=All&var-run1=3997&var-run2=4018&var-threshold=0.02&var-use_historical_thresholds=true&var-threshold_expression=2%20%2A%20percentile_cont%280.90%29&var-exact_suite_version=false&from=now-2d&to=now

The reason for the execution time change is that Gather Merge -> Sort -> Parallel Append -> DecompressChunk -> Parallel Seq Scan is chosen over Merge Append -> Sort -> DecompressChunk -> Seq Scan. This is because Postgres doesn't support Gather Merge -> Merge Append, as I mentioned on Slack before. Probably something we can improve in the future.

This happens for queries like SELECT * FROM space_part ORDER BY time DESC, a LIMIT 1;

akuzm added 30 commits May 3, 2024 17:03

Add unsorted decompressed chunk path even if we have sorted ones

76c1f97

The unsorted paths are better for hash aggregation, but currently in this case we are only going to add sorted paths.

label the path w/o sorting with the proper pathkeys

868d253

ref

eef6db3

set all parameters

ca9d1c3

accept the transparent_decompression ref

f7b1ec5

simplify?

848c6ee

tmp debug

397717d

reference merge_append_partially_compressed-*

7643343

reference REL_14_9-96-g162b38a068 merge_append_partially_compressed-*

4f5b34f

reference REL_13_9 merge_append_partially_compressed-*

68062c7

Fix some flakiness in transparent_decompression test

2767664

Add ANALYZE after compression. The plan changes are expected, SeqScans are preferred over IndexScans and Sort over MergeAppend for small tables.

reference REL_13_9 transparent_decompression-*

f0f4ebc

reference REL_16_0-116-g67738dbf9c transparent_decompression-*

c8b1bac

reference REL_14_9-96-g162b38a068 transparent_decompression-*

5f2764f

add vacuum as well

06c087b

reference REL_16_1 transparent_decompression-*

5d44042

reference REL_14_11 transparent_decompression-*

3476da5

reference REL_13_9 transparent_decompression-*

5ffd24a

Merge remote-tracking branch 'akuzm/flaky-merge' into HEAD

4fba0f5

capitalization

5bbf5df

add vacuum

2670505

vacuum analyze

3b62a11

Merge remote-tracking branch 'akuzm/transparent-flaky' into HEAD

d7e656a

try to make less invasive changes to prevent unexplainable flakiness in

f92a7ca

the CI

Merge remote-tracking branch 'akuzm/transparent-flaky' into HEAD

b04d9fb

Merge remote-tracking branch 'akuzm/flaky-merge' into HEAD

6be134e

Merge remote-tracking branch 'akuzm/double-sort' into HEAD

5e496c0

bigger table?

0859a3a

akuzm added 3 commits December 17, 2024 14:21

reference REL_17_0-80-gb7467ab71c transparent_decompression-* merge_a…

c5ace45

…ppend_partially_compressed-* ordered_append-*

reference REL_14_11 transparent_decompression-* merge_append_partiall…

e592158

…y_compressed-* ordered_append-*

reference REL_16_4-111-g925b3aa857 transparent_decompression-* merge_…

65fdd96

…append_partially_compressed-* ordered_append-*

svenklemm reviewed Dec 17, 2024

View reviewed changes

src/planner/planner.h Outdated Show resolved Hide resolved

akuzm added 11 commits December 17, 2024 15:06

more const

1812105

remove the macro

757fb52

make a separate function

3e7ec2d

brrace

8b0a093

unify parallel path handling

8010a6b

benchmark unsorted paths (2024-12-17 no. 2)

8528478

fix the minmax initplan cost

ae8149a

benchmark unsorted paths (2024-12-18 no. 3)

81d4d85

some fixes and gather over sort

6ca7143

Revert "Chunk-wise agg: add Gather above Sort"

21bfc85

This reverts commit e94bd26.

accept the refs

3916fe0

akuzm commented Dec 19, 2024

View reviewed changes

svenklemm reviewed Dec 21, 2024

View reviewed changes

akuzm added 3 commits December 23, 2024 13:37

benchmark unsorted paths (2024-12-23 no. 5)

e834b23

create parallel index paths for compressed table

a084f90

benchmark unsorted paths (2024-12-23 no. 6)

f790af4

akuzm added 7 commits December 23, 2024 18:28

benchmark unsorted paths (2024-12-23 no. 8)

96a129b

fix

ad804a3

optimizations 1

b3beeda

optimization 2

c3287de

optimizations.....

8db0166

batch sorted merge

d4825e4

benchmark unsorted paths (2024-12-24 no. 9)

1c81dc1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unsorted decompressed chunk path even if we have sorted ones #6879

Add unsorted decompressed chunk path even if we have sorted ones #6879

akuzm commented May 3, 2024 •

edited

Loading

akuzm Dec 19, 2024

akuzm Dec 19, 2024 •

edited

Loading

svenklemm commented Dec 21, 2024 •

edited

Loading

svenklemm Dec 21, 2024

akuzm Dec 23, 2024

akuzm commented Dec 23, 2024 •

edited

Loading

Add unsorted decompressed chunk path even if we have sorted ones #6879

Are you sure you want to change the base?

Add unsorted decompressed chunk path even if we have sorted ones #6879

Conversation

akuzm commented May 3, 2024 • edited Loading

akuzm Dec 19, 2024

Choose a reason for hiding this comment

akuzm Dec 19, 2024 • edited Loading

Choose a reason for hiding this comment

svenklemm commented Dec 21, 2024 • edited Loading

svenklemm Dec 21, 2024

Choose a reason for hiding this comment

akuzm Dec 23, 2024

Choose a reason for hiding this comment

akuzm commented Dec 23, 2024 • edited Loading

akuzm commented May 3, 2024 •

edited

Loading

akuzm Dec 19, 2024 •

edited

Loading

svenklemm commented Dec 21, 2024 •

edited

Loading

akuzm commented Dec 23, 2024 •

edited

Loading